perm filename LET.E2[AM,DBL]1 blob
sn#373546 filedate 1978-08-16 generic text, type T, neo UTF8
To: Dr. Eamon Barrett, NSF
From: Dr. Douglas B. Lenat, Carnegie-Mellon University
Date: July 7, 1978
Subject: Annual Review of Research (NSF #7704440)
July 1, 1977 -- June 30, 1978
The general subject I'm studying is Discovery: how are humans able
to beat what a priori seems an enormous search space, and continually
come up with viable, creative new syntheses? In particular, how does
a mathematician know which concepts to define, what kinds of
relationships to hunt for, how to go about proving conjectures he
suspects are true, etc.? The primary vehicle for this research has
been a theory (mathematicians are guided by a very large collection
of rules of thumb) and a computer model embodying it (the AM
program). But AM's limitation was that, as new ever-more-specific
concepts were defined and developed, new specific heurisitics were
not. The focus of this past year's research has therefore been: how
can we get our new program (Eurisko) to discover new heuristic rules
of thumb, rules which are specific to the domains that the program
itself discovers, rules which are powerful enough to effectively
guide the program's search as it explores those new fields.
The hypothesis we made is at once bold, risky, yet the obvious
first-order guess: we can walk around the space of heuristics using
pretty much the same knowledge we used to walk around the space of
math concepts. For instance, a useful heuristic to AM said "Given a
new function, pay special attention to what it maps extrema into".
Our hypothesis says that this can also be used to guide the search
for effective new heuristics: "Given a new heuristic, pay special
atention to what it maps extrema into". In the first case, the
extrema were simply extreme boundary examples of the domain of the
function; in the second case, extrema refer to extreme examples of
mathematical concepts (e.g., extremely useful ones, known frequent
counterexamples, etc.) The results to date are not yet conclusive; we
are still implementing. Very soon we should know how far off we were
in our hypothesis about the nature of the space of heuristic rules.
Let me mention how it is that we were led to this hypothesis, since
it is not a part of the original proposal made last year. For a long
period of time, we believed that the key lay in the codification of
a large number of "meta-rules" -- informal pieces of knowledge about
how to create and manipulate heuristics. But the more we studied
such meta-rules, the more elusive they became: there is very little
which one can say about heuristic rules that does not apply also to,
say, mathematical operations. E.g., "IF the thing being considered
is very costly to apply, that lowers its overall worth rating"; "IF
the thing sometimes produces good results and sometimes poor ones,
THEN try to find a simple discriminator that partitions the thing's
domain into two classes, most of the first producing good results,
most of the second producing inferior results; and define and focus
on that thing restricted to the former subset of its old domain".
The basic discovery, then, was that heuristics are not so unlike
mathematical functions. The heuristics relevant to walking
successfully around the space of mathematical functions might do an
adequate job of walking around the space of heuristics. But AM
already had a large collection of just such heuristics. The task
that was left was to rewrite those heuristics in a form that would
enable them to OPERATE UPON EACH OTHER. The solution was then quick
in appearing: represent each heuristic the same way in which AM
represents each mathematical function: as a fulf-fledged concept with
facets (a frame with slots, a unit with aspects, a property list with
properties). This has been the task of the past year,
re-representing all of AM's heuristics.
To date, there is one published paper which mentions this year's
research: "The Ubiquity of Discovery". This was the 1977 Computers
and Thought Award Lecture, and was reprinted in THe Journal of
Artificial Intelligence, Volume 9, Number 3, in December, 1977. It
was also an invited talk at -- and reprinted in the proceedings of --
the National Computer Conference, Anaheim, June, 1978. In addition,
this talk was an invited lecture at the Gordon Research Conference on
"Scientific Information Problems in Research", this month in New
Hampshire. Two copies are enclosed, reprints of the AI Journal
article.